Prediction of the Isoelectric Point of an Amino Acid Based on GA-PLS and SVMs
نویسندگان
چکیده
The support vector machine (SVM), as a novel type of a learning machine, for the first time, was used to develop a QSPR model that relates the structures of 35 amino acids to their isoelectric point. Molecular descriptors calculated from the structure alone were used to represent molecular structures. The seven descriptors selected using GA-PLS, which is a sophisticated hybrid approach that combines GA as a powerful optimization method with PLS as a robust statistical method for variable selection, were used as inputs of RBFNNs and SVM to predict the isoelectric point of an amino acid. The optimal QSPR model developed was based on support vector machines, which showed the following results: the root-mean-square error of 0.2383 and the prediction correlation coefficient R=0.9702 were obtained for the whole data set. Satisfactory results indicated that the GA-PLS approach is a very effective method for variable selection, and the support vector machine is a very promising tool for the nonlinear approximation.
منابع مشابه
Characteristics Determination of Rheb Gene and Protein in Raini Cashmere Goat
The aim of the present study was todeterminecharacteristics of Rheb gene and protein in Raini Cashmere goat. Comparative analyses of the nucleotide sequences were performed. Open reading frames (ORFs), theoretical molecular weights of deduced polypeptides, the protein isoelectric point, protein characteristics and three-dimensional structures was predicted using online standard softwares. The f...
متن کاملMolecular Characterization of a Three-disulfide Bridges Beta-like Neurotoxin from Androctonus crassicauda Scorpion Venom
Scorpion venom is the richest source of peptide toxins with high levels of specific interactions with different ion-channel membrane proteins. The present study involved the amplification and sequencing of a 310-bp cDNA fragment encoding a beta-like neurotoxin active on sodium ion-channel from the venom glands of scorpion Androctonus crassicauda belonging to the Buthidae family using r...
متن کاملBroiler Diets Formulated Based on Digestible Amino Acid Values as Determined by in vivo and Prediction Methods
The aim of the present study was to assess whether near infrared reflectance spectroscopy (NIRS) and regression equations are the practical and accurate approach of nutritional assessment of common feedstuffs. Therefore two experiments were conducted to study the effect of amino acid determination methods on broiler performance. In experiment I, two hundred thirty four male Ross broiler chicks ...
متن کاملIn silico comparison of Iranian HIV -1 envelop glycoprotein with five nearby countries
HIV-1 envelope (env) glycoprotein mediates an important role in entry of the virus into the susceptible target cells. As env glycoprotein of HIV-1 is highly variable in the different geographical regions, in the present study, different properties of this protein in Iran are compared with five nearby countries. The sequences of HIV-1 env glycoproteins of Iran, Afghanistan, Russia, Turkey, Pakis...
متن کاملAmino acid and fatty acid profiles of materials recovered from Prussian carp, Carassius gibelio (Bloch, 1782), using acidic and basic solubilization/ precipitation technique
Isoelectric solubilization /precipitation (ISP) process was used to isolate protein from muscles of Prussian carp, Carassius gibelio (Bloch, 1782). Fish protein and lipid were recovered from whole gutted Prussian carp using acidic and basic isoelectric solubilization/precipitation followed by assaying amino acid and fatty acid profile. Essential amino acids content in acidic and basic pH treatm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of chemical information and computer sciences
دوره 44 1 شماره
صفحات -
تاریخ انتشار 2004